Evolutionary reinforcement learning via cooperative coevolutionary negatively correlated search
نویسندگان
چکیده
Evolutionary algorithms (EAs) have been successfully applied to optimize the policies for Reinforcement Learning (RL) tasks due their exploration ability. The recently proposed Negatively Correlated Search (NCS) provides a distinct parallel search behavior and is expected facilitate RL more effectively. Considering that commonly adopted neural usually involves millions of parameters be optimized, direct application NCS may face great challenge large-scale space. To address this issue, paper presents an NCS-friendly Cooperative Coevolution (CC) framework scale-up while largely preserving its behavior. issue traditional CC can deteriorate also discussed. Empirical studies on 10 popular Atari games show method significantly outperform three state-of-the-art deep methods with 50% less computational time by effectively exploring 1.7 million-dimensional
منابع مشابه
Cooperative Coevolutionary Ensemble Learning
A new optimization technique is proposed for classifiers fusion — Cooperative Coevolutionary Ensemble Learning (CCEL). It is based on a specific multipopulational evolutionary algorithm — cooperative coevolution. It can be used as a wrapper over any kind of weak algorithms, learning procedures and fusion functions, for both classification and regression tasks. Experiments on the real-world prob...
متن کاملA Comparison of Evolutionary and Coevolutionary Search
We present a comparative study of an evolutionary and a coevolutionary search model. In the latter, strategies for solving a problem coevolve with training cases. We find that the coevolutionary model has a relatively large efficacy: 41 out of 50 (82%) of the simulations produce high quality strategies. In contrast, the evolutionary model has a very low efficacy: 1 out of 50 runs (2%) produce h...
متن کاملCoevolutionary networks of reinforcement-learning agents
This paper presents a model of network formation in repeated games where the players adapt their strategies and network ties simultaneously using a simple reinforcement-learning scheme. It is demonstrated that the coevolutionary dynamics of such systems can be described via coupled replicator equations. We provide a comprehensive analysis for three-player two-action games, which is the minimum ...
متن کاملCooperative Inverse Reinforcement Learning
For an autonomous system to be helpful to humans and to pose no unwarranted risks, it needs to align its values with those of the humans in its environment in such a way that its actions contribute to the maximization of value for the humans. We propose a formal definition of the value alignment problem as cooperative inverse reinforcement learning (CIRL). A CIRL problem is a cooperative, parti...
متن کاملIntegrating System Optimum and User Equilibrium in Traffic Assignment via Evolutionary Search and Multiagent Reinforcement Learning
Traffic assignment is fundamentally a tool for transportation planning. It allocates trips within the traffic network. However, modern uses of traffic assignment also include shorter time horizons and even real-time use (e.g., for route recommendation). In the latter case, it is interesting to recommend routes that are as close as possible to the system optimum. To compute an approximation of t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Swarm and evolutionary computation
سال: 2022
ISSN: ['2210-6502', '2210-6510']
DOI: https://doi.org/10.1016/j.swevo.2021.100974